智能论文笔记

Rich Event Modeling for Script Event Prediction

Long Bai , Saiping Guan , Zixuan Li , Jiafeng Guo , Xiaolong Jin , Xueqi Cheng

分类：自然语言处理 | 人工智能

2022-12-16

Script is a kind of structured knowledge extracted from texts, which contains a sequence of events. Based on such knowledge, script event prediction aims to predict the subsequent event. To do so, two aspects should be considered for events, namely, event description (i.e., what the events should contain) and event encoding (i.e., how they should be encoded). Most existing methods describe an event by a verb together with only a few core arguments (i.e., subject, object, and indirect object), which are not precise. In addition, existing event encoders are limited to a fixed number of arguments, which are not flexible to deal with extra information. Thus, in this paper, we propose the Rich Event Prediction (REP) framework for script event prediction. Fundamentally, it is based on the proposed rich event description, which enriches the existing ones with three kinds of important information, namely, the senses of verbs, extra semantic roles, and types of participants. REP contains an event extractor to extract such information from texts. Based on the extracted rich information, a predictor then selects the most probable subsequent event. The core component of the predictor is a transformer-based event encoder to flexibly deal with an arbitrary number of arguments. Experimental results on the widely used Gigaword Corpus show the effectiveness of the proposed framework.

translated by 谷歌翻译

DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization

Xiangyu Xu , Li Guan , Enrique Dunn , Haoxiang Li , Guang Hua

分类：计算机视觉

2022-12-08

In this paper, we propose an end-to-end framework that jointly learns keypoint detection, descriptor representation and cross-frame matching for the task of image-based 3D localization. Prior art has tackled each of these components individually, purportedly aiming to alleviate difficulties in effectively train a holistic network. We design a self-supervised image warping correspondence loss for both feature detection and matching, a weakly-supervised epipolar constraints loss on relative camera pose learning, and a directional matching scheme that detects key-point features in a source image and performs coarse-to-fine correspondence search on the target image. We leverage this framework to enforce cycle consistency in our matching module. In addition, we propose a new loss to robustly handle both definite inlier/outlier matches and less-certain matches. The integration of these learning mechanisms enables end-to-end training of a single network performing all three localization components. Bench-marking our approach on public data-sets, exemplifies how such an end-to-end framework is able to yield more accurate localization that out-performs both traditional methods as well as state-of-the-art weakly supervised methods.

translated by 谷歌翻译

KATSum: Knowledge-aware Abstractive Text Summarization

Guan Wang , Weihua Li , Edmund Lai , Jianhua Jiang

分类：自然语言处理 | 人工智能

2022-12-06

Text Summarization is recognised as one of the NLP downstream tasks and it has been extensively investigated in recent years. It can assist people with perceiving the information rapidly from the Internet, including news articles, social posts, videos, etc. Most existing research works attempt to develop summarization models to produce a better output. However, advent limitations of most existing models emerge, including unfaithfulness and factual errors. In this paper, we propose a novel model, named as Knowledge-aware Abstractive Text Summarization, which leverages the advantages offered by Knowledge Graph to enhance the standard Seq2Seq model. On top of that, the Knowledge Graph triplets are extracted from the source text and utilised to provide keywords with relational information, producing coherent and factually errorless summaries. We conduct extensive experiments by using real-world data sets. The results reveal that the proposed framework can effectively utilise the information from Knowledge Graph and significantly reduce the factual errors in the summary.

translated by 谷歌翻译

NDD: A 3D Point Cloud Descriptor Based on Normal Distribution for Loop Closure Detection

Ruihao Zhou , Li He , Hong Zhang , Xubin Lin , Yisheng Guan

分类：机器人

2022-09-26

循环闭合检测是在复杂环境中长期机器人导航的关键技术。在本文中，我们提出了一个全局描述符，称为正态分布描述符（NDD），用于3D点云循环闭合检测。描述符编码点云的概率密度分数和熵作为描述符。我们还提出了快速旋转对准过程，并将相关系数用作描述符之间的相似性。实验结果表明，我们的方法在准确性和效率上都优于最新点云描述符。源代码可用，可以集成到现有的LIDAR射测和映射（壤土）系统中。

translated by 谷歌翻译

Causal Coupled Mechanisms: A Control Method with Cooperation and Competition for Complex System

Xuehui Yu , Jingchi Jiang , Xinmiao Yu , Yi Guan , Xue Li

分类：人工智能

2022-09-15

复杂的系统在现实世界中无处不在，并且往往具有复杂且理解不足的动态。对于他们的控制问题，挑战是保证在这种肿的和陷入困境的环境中的准确性，鲁棒性和概括。幸运的是，复杂的系统可以分为人类认知似乎可以利用的多个模块化结构。受到一种新型控制方法的启发，提出了一种新颖的控制方法，是一种因果关系机制（CCMS），它提出了探索组合分裂和竞争的合作。我们的方法采用了层次强化学习理论（HRL），其中1）具有竞争意识的高级政策将整个复杂系统划分为多种功能机制，以及2）低级政策完成了每种机制的控制任务。特别是用于合作的级联控制模块有助于CCM的串联操作，并使用向前耦合的推理模块来恢复分区过程中丢失的耦合信息。在合成系统和现实世界的生物调节系统上，CCM方法即使有不可预测的随机噪声，CCM方法也可以达到稳健和最新的控制结果。此外，概括结果表明，重复使用准备的专业CCM有助于在具有不同混杂因素和动态的环境中表现良好。

translated by 谷歌翻译

PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection

Han Wang , Jun Tang , Xiaodong Liu , Shanyan Guan , Rong Xie , Li Song

分类：计算机视觉 | 人工智能

2022-09-06

近年来，人们见证了应用上下文框架以提高对象检测作为视频对象检测的性能的趋势。现有方法通常一次汇总功能以增强功能。但是，这些方法通常缺少来自相邻帧的空间信息，并且缺乏功能聚合不足。为了解决这些问题，我们执行一种渐进式方式来引入时间信息和空间信息以进行集成增强。时间信息由时间特征聚合模型（TFAM）引入，通过在上下文框架和目标框架之间进行注意机制（即要检测到的框架）。同时，我们采用空间过渡意识模型（StAM）来传达每个上下文框架和目标框架之间的位置过渡信息。我们的PTSeformer建立在基于变压器的检测器DETR上，还遵循端到端的方式，以避免重大的后处理程序，同时在Imagenet VID数据集上获得88.1％的地图。代码可在https://github.com/hon-wong/ptseformer上找到。

translated by 谷歌翻译

Self-supervised Contrastive Representation Learning for Semi-supervised Time-Series Classification

Emadeldeen Eldele , Mohamed Ragab , Zhenghua Chen , Min Wu , Chee-Keong Kwoh , Xiaoli Li , Cuntai Guan

分类：机器学习

2022-08-13

学习时间序列表示只有未标记的数据或几个标签样本可用时，可能是一项具有挑战性的任务。最近，通过对比，通过对比的不同数据观点从未标记的数据中提取有用的表示形式方面，对对比的自我监督学习表现出了很大的改进。在这项工作中，我们通过时间和上下文对比（TS-TCC）提出了一个新颖的时间序列表示学习框架，该框架从未标记的数据中学习了具有对比性学习的无标记数据的表示。具体而言，我们建议时间序列特定的弱和强大的增强，并利用他们的观点在拟议的时间对比模块中学习稳健的时间关系，除了通过我们提出的上下文对比模块学习判别性表示。此外，我们对时间序列数据增强选择进行系统研究，这是对比度学习的关键部分。我们还将TS-TCC扩展到了半监督的学习设置，并提出了一种类感知的TS-TCC（CA-TCC），从可用的少数标记数据中受益，以进一步改善TS-TCC学到的表示。具体而言，我们利用TS-TCC生成的强大伪标签来实现班级感知的对比损失。广泛的实验表明，对我们提议的框架所学的功能的线性评估与完全监督的培训相当。此外，我们的框架在少数标记的数据和转移学习方案中显示出高效率。该代码可在\ url {https://github.com/emadeldeen24/ts-tcc}上公开获得。

translated by 谷歌翻译

A Cooperative Positioning Flamework for Robot and Smart Phone Based on Visible Light Communication

Junye Chen , Fangdi Li , Futong An , Chen Yang , Hongzhan Song , Shangsheng Wen , Weipeng Guan

分类：机器人

2022-08-11

提出了基于可见光通信（VLC）的人类和机器人的合作定位火焰。根据实验系统，我们证明它具有很高的精度和实时性能。

translated by 谷歌翻译

Graph Signal Processing for Heterogeneous Change Detection Part II: Spectral Domain Analysis

Yuli Sun , Lin Lei , Dongdong Guan , Gangyao Kuang , Li Liu

分类：计算机视觉

2022-08-03

这是本文的第二部分，为异质变化检测（HCD）问题提供了新的策略，即从图形信号处理（GSP）的角度解决HCD。我们构造一个图表以表示每个图像的结构，并将每个图像视为图表上定义的图形信号。这样，我们可以将HCD问题转换为图表上定义的系统的信号响应的比较。在第一部分中，通过比较顶点域的图之间的结构差来衡量变化。在本第二部分中，我们分析了来自光谱域的HCD的GSP。我们首先分析同一图上不同图像的光谱特性，并表明它们的光谱表现出共同点和差异。特别是，正是变化导致了光谱的差异。然后，我们提出了HCD的回归模型，该模型将源信号分解为回归信号并更改信号，并且需要回归的信号具有与同一图上的目标信号相同的光谱属性。借助图光谱分析，提出的回归模型是灵活且可扩展的。对七个真实数据集进行的实验显示了该方法的有效性。

translated by 谷歌翻译

Graph Signal Processing for Heterogeneous Change Detection Part I: Vertex Domain Filtering

Yuli Sun , Lin Lei , Dongdong Guan , Gangyao Kuang , Li Liu

分类：计算机视觉

2022-08-03

本文为异构变化检测（HCD）问题提供了一种新的策略：从图形信号处理（GSP）的角度解决HCD。我们为每个图像构造一个图表以捕获结构信息，并将每个图像视为图形信号。通过这种方式，我们将HCD转换为GSP问题：对两个图上定义的不同系统的响应的比较，试图找到结构性差异（第I部分）和信号差异（第II部分）异质图像之间的变化。在第一部分中，我们用顶点域的GSP分析了HCD。我们首先证明，对于未改变的图像，它们的结构是一致的，然后在两个图上定义的系统上的相同信号的输出相似。但是，一旦区域发生变化，图像的局部结构会发生变化，即包含该区域的顶点的连通性发生变化。然后，我们可以比较通过在两个图上定义的过滤器的相同输入图信号的输出信号以检测更改。我们设计了来自顶点域的不同过滤器，可以灵活地探索原始图中隐藏的高阶邻域信息。我们还从信号传播的角度分析了变化区域对变化检测结果的有害影响。在七个真实数据集上进行的实验显示了基于顶点域滤波的HCD方法的有效性。

translated by 谷歌翻译